Fuzzy Clustering with Prototype Extraction for Census Data Analysis

نویسندگان

  • Oleg Chertov
  • Marharyta Aleksandrova
چکیده

Not long ago primary census data became available to publicity. It opened qualitatively new perspectives not only for researchers in demography and sociology, but also for those people, who somehow face processes occurring in society. In this paper authors propose using Data Mining methods for searching hidden patterns in census data. A novel clusteringbased technique is described as well. It allows determining factors which influence people behavior, in particular decisionmaking process (as an example, a decision whether to have a baby or not). Proposed technique is based on clustering a set of respondents, for whom a certain event have already happened (for instance, a baby was born), and discovering clusters' prototypes from a set of respondents, for whom this event hasn't occurred yet. By means of analyzing clusters' and their prototypes' characteristics it is possible to identify which factors influence the decision-making process. Authors also provide an experimental example of the described approach usage.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis

Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...

متن کامل

A Fuzzy Clustering Model of Data and Fuzzy c-Means

The Multiple Prototype Fuzzy Clustering Model (FCMP), introduced by Nascimento, Mirkin and Moura-Pires (1999), proposes a framework for partitional fuzzy clustering which suggests a model of how the data are generated from a cluster structure to be identi...ed. In the model, it is assumed that the membership of each entity to a cluster expresses a part of the cluster prototype re‡ected in the e...

متن کامل

A Fuzzy C-means Algorithm for Clustering Fuzzy Data and Its Application in Clustering Incomplete Data

The fuzzy c-means clustering algorithm is a useful tool for clustering; but it is convenient only for crisp complete data. In this article, an enhancement of the algorithm is proposed which is suitable for clustering trapezoidal fuzzy data. A linear ranking function is used to define a distance for trapezoidal fuzzy data. Then, as an application, a method based on the proposed algorithm is pres...

متن کامل

A Split-and-Merge Segmentation Algorithm for Line Extraction in 2-D Range Image

This paper presents a segmentation method for line extraction in 2-D range images. It uses a prototype-based fuzzy clustering algorithm in a split-and-merge framework. The split-and-merge structure allows us to use the fuzzy clustering algorithm without any previous knowledge on the number of prototypes. This algorithm aims to be used in mobile robots navigation systems for dynamic map building...

متن کامل

Greater Knowledge Extraction Based on Fuzzy Logic And GKPFCM Clustering Algorithm

This work proposes how to generate a set of fuzzy rules from a data set using a clustering algorithm, the GKPFCM. If we recommend a number of clusters, the GKPFCM identifies the location and the approximate shape of each cluster. These ones describe the relations among the variables of the data set, and they can be expressed as conditional rules such as "If/Then". The GKPFCM provides membership...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013